Picture for Julian McAuley

Julian McAuley

Masking Stale Observations Helps Search Agents -- Until It Doesn't: A Regime Map and Its Mechanism

Add code
May 29, 2026
Viaarxiv icon

Live Music Diffusion Models: Efficient Fine-Tuning and Post-Training of Interactive Diffusion Music Generators

Add code
May 21, 2026
Viaarxiv icon

Auto-Dreamer: Learning Offline Memory Consolidation for Language Agents

Add code
May 20, 2026
Viaarxiv icon

F-GRPO: Factorized Group-Relative Policy Optimization for Unified Candidate Generation and Ranking

Add code
May 13, 2026
Viaarxiv icon

MLPs are Efficient Distilled Generative Recommenders

Add code
May 12, 2026
Viaarxiv icon

FERA: Uncertainty-Aware Federated Reasoning for Large Language Models

Add code
May 11, 2026
Viaarxiv icon

MASS-DPO: Multi-negative Active Sample Selection for Direct Policy Optimization

Add code
May 11, 2026
Viaarxiv icon

Skill-R1: Agent Skill Evolution via Reinforcement Learning

Add code
May 10, 2026
Viaarxiv icon

Expressiveness Limits of Autoregressive Semantic ID Generation in Generative Recommendation

Add code
May 07, 2026
Viaarxiv icon

From Local Indices to Global Identifiers: Generative Reranking for Recommender Systems via Global Action Space

Add code
Apr 28, 2026
Viaarxiv icon